THE WORD PROBLEM IN THE BAUMSLAG GROUP WITH A 
NON-ELEMENTARY DEHN FUNCTION IS POLYNOMIAL TIME 

DECIDABLE 

ALEXEI MIASNIKOV, ALEXANDER USHAKOV, AND DONG WOOK WON 



Abstract. We prove that the Word problem in the Baumslag group C?{i,2) = 
(^a,b ; a" = a^^ which has a non-elementary Dehn function is decidable in 
polynomial time. 

Keywords. Word problem, one-relator groups, Magnus breakdown, power 
circuits, computational complexity. 

2010 Mathematics Subject Classification. 20F10, 11Y16. 

1. Introduction 

One-relator groups form a very interesting and very mysterious class of groups. 
In 1910s Dehn proved that the Word problem for the standard presentation of the 
fundamental group of a closed oriented surface of genus at least two is solvable 
by what is now called Dehn's algorithm (see [21] for details). In 1932 Magnus 
developed a general powerful approach to one-relator groups [19] , nowadays known 
as Magnus break-down procedure (see (5DJ[TH]). In particular, he solved the Word 
problem (WP) in an arbitrary one-relator group. The decision algorithm is quite 
complicated and its time complexity is unknown. In fact, we show here that the 
time function of the Magnus decision algorithm on the Baumslag group 

G(i.2) — {o.,b I b^^a^^bab^^ab — a^^ 

is not bounded by any finite tower of exponents. Furthermore, it is unknown 
whether there exists any feasible general (uniform) algorithm that solves WP in all 
one-relator groups, and at present it seems implausible that such algorithm exists. 
However, it is quite possible that the Word problem in every fixed one-relator group 
is tractable. In the Magnus collection of open problems in groups theory [5] the 
following question is posted. 

Problem 1.1 ([5], (0R3)). Is it true that WP in every given one-relator group G 
is decidable in polynomial time? 

The current state of affairs on WP in one-relator groups can be described as 
follows. On one hand, there are several large classes of one-relator groups where 
WP is well understood and is decidable in polynomial time (hyperbolic, automatic, 
linear, etc). On the other hand, there are several sporadic examples of one-relator 
groups where WP requires a special treatment, though at the end is polynomial 
time decidable. Finally, there is a few one-relator groups where WP seems especially 
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hard and the time complexity is miknown. These are the most interesting ones in 
this context. 

One of the principal unsolved mysteries on one-relator groups is which of them 
have a hard WP and why. More precisely, the problem is to determine the "general 
classes" of one-relator groups and divide the rest (the sporadic, exceptional ones) 
into some well-defined families. 

There are several conjectures that describe large general classes of one-relator 
groups which wc would like to mention here. 

1.1. Hyperbolic groups. Notice, that if G is hyperbolic, in particular, if it sat- 
isfies the small cancelation condition C"(^), then WP in G is decidable in linear 
time by Dehn's algorithm [T3]. Since the asymptotic density of the set of words 
w G F{X) for which the symmetrized one-relator presentation {X \ w) is C"(i) 
small cancelation is equal to 1, one may say that for generic one-relator groups 
the answer to the question above is affirmative. One can check in polynomial (at 
most quadratic) time if a one-relator presentation, when symmetrized, is C"(^) or 
not. Hence, it is possible to run in parallel the Magnus break-down process and the 
Dehn's algorithm for symmetrized C"(i) presentations and obtain a correct uniform 
total algorithm that solves WP in one-relator groups, and has Ptime complexity on 
the set of one-relator groups of asymptotic density 1. Unfortunately, such an algo- 
rithm will not be feasible on the most interesting examples of one-relator groups. 
Some interesting examples of hyperbolic one-relator groups can be found in | 15j . 

Of course, not all one-relator groups are hyperbolic. The famous Baumslag- 
Solitar one-relator groups 

B{m,n) = (a,& I b'-'^a'^b^ a''),m,n> 1, 

introduced in [B] are not hyperbolic, since the groups B(^i n) are infinite metabelian, 
and the other ones contain F2 x Z as a, subgroup. 

The following outstanding conjecture (see [5]) describes, if true, one-relator hy- 
perbolic groups. 

Problem 1.2. Is every one-relator group without Baumslag-Solitar subgroups hy- 
perbolic? 

Independently of the above, it is very interesting to know which one-relator 
groups contain groups 

Problem 1.3. Is there an algorithm to recognize if a given one-relator group con- 
tains a subgroup -B(m,n) for some m,n>17 

Notice, that in 1968 B. B. Newman in [5J showed that all one-relator groups 
with torsion are hyperbolic and, hence, the Word problem for them is decidable in 
linear time. 

1.2. Automatic groups. Automatic groups form another class where WP is easy. 
It is known that every hyperbolic group is automatic and WP is decidable in at 
most quadratic time in a given automatic group. Furthermore, the Dehn function 
in automatic groups is quadratic. We refer to |llj for more details on automatic 
groups. Observe, that the group i?(„i^„) is not automatic provided m ^ n, since its 
Dehn function is exponential. 



WORD PROBLEM FOR THE BAUMSLAG GROUP G(i,2) 3 

The main challenge in this area is to describe one-relator automatic groups. An- 
swering the following questions would help to understand which one-relator groups 
are automatic. 

Problem 1.4. Is it true that one-relator groups with a quadratic Dehn function 
are automatic? 

Problem 1.5. Is it true that one-relator groups with no subgroups isomorphic to 
B(m.n)i m ^ n are automatic? 

Problem 1.6 ([5] (0R8)). Is the one-relator group {X \ [u,?;]) automatic for any 
words u, w G F{X)? 

1.3. Linear and residually finite groups. Lipton and Zalstein in [T7] proved 
that WP in linear groups is polynomial time decidable, so one-relator linear groups 
provide a general subclass of one-relator groups where WP is easy. Until recently, 
not much was known about linearity of one-relator groups. We refer to [3] for an 
initial discussion that formed the area for years to come. The real breakthrough 
came in 2009 when Wise announced in [55] that if a hyperbolic group G has a 
quasi-convex hierarchy then it is virtually a subgroup of a right angled Artin group 
and, hence, is linear. This result covers a lot of one-relator groups, in particular all 
one-relator groups with torsion. There are two interesting cases that we would like 
to mention here. In [T Baumslag introduced cyclically pinched one-relator groups 
as those ones that can be presented as a free product of free groups with cyclic 
amalgamation 

{XUY\u = v)= F{X) F{Y) 

where u € F{X) and v G F{Y) are non-trivial non-primitive elements in the cor- 
responding factors. Similarly, one can define conjugacy pinched one-relator groups 
as HNN extensions of free groups with cyclic associated subgroups: 

{F{X),t I t-^ut ^ v) 

Wehrfritz proved in 27 that if non of u and v are proper powers then the group 
F{X) *u=v F{Y) is linear. However, it was shown in [71 [TB] that if either u or u is 
not a proper power then the group F{X) F{Y) is hyperbolic, so WP in these 
groups is linear time decidable. Similar results hold for conjugacy pinched one- 
relator groups as well. Observe, that cyclically and conjugacy pinched one-relator 
hyperbolic groups have quasi-convex hierarchy, so their linearity follows from Wise's 
result. On the other hand, WP in hyperbolic groups is easy anyway, so linearity in 
this case does not give much in terms of the efficiency of WP. 

The general problem which one-relator non-hyperbolic groups are linear is wide 
open. Recall, that every finitely generated linear group is residually finite. Hence, 
to see that a given one-relator groups is not linear it suffices to show that it is not 
residually finite. 

Notice that there is a special decision algorithm for WP in residually finite 
finitely presented groups. The algorithm when given such a group {X \ R) and 
a word w G F(X) runs two procedures in parallel: the first one enumerates all the 
consequences of the relators R until the word w occurs, in which case w = 1 in G; 
while the second one checks if w is non-trivial in some finite quotient of G. Since R 
is finite and G is residually finite, one of the two procedures eventually stops and 
gives the solution of WP for w. However, this algorithm is extremely inefficient. 
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This is why we do not discuss residually finite one-relator groups as a separate class 
here, but only briefly mention the results that are related to linearity. 

Meskin in [22^ studied residual finiteness of the following special class of one- 
relator groups: 

B{u, V, m, n) — (^X \ u^^v"^u = f"), m, n > 1, 

where u and v are arbitrary non-commuting elements in F{X). He showed that if 
m ^ l,n =^ l,m =/= n then the group B{u, v, m, n) is not residually finite. It follows 
that the group -B(m,n) is residually finite if and only if m = 1, or n = 1, or m = n. 

Later Vol'vachev in j26| found linear representation for all residually finite groups 
B{u,v,'m,n). Sapir and Drutu constructed in |10j the first example of residually 
finite non-linear one-relator groups. They showed that the group 

DS = {a,t\ t-^at^ = a^) 

is residually finite and non-linear. 

The general classes of one-relator groups described above are the only known 
ones where WP is polynomial time decidable. Now we describe the known sporadic 
one-relator groups where WP is presumably hard or requires a special approach. 

1.4. Baumslag-Solitar groups. Gersten showed that the groups i3(m,n)j where 
m n, have exponential Dehn functions [12] (see also [11] and 0), so they are not 
hyperbolic or automatic. As we mentioned above the metabelian groups are 
linear, so WP in them is polynomial time decidable. The non-metabelian groups 
B[m,n) are not linear, so WP in them requires a special approach. Nevertheless, WP 
in these groups is polynomial time decidable (see Section [2]). It would be interesting 
to study WP in the groups _B(u, w, to, n) which are similar to the Baumslag-Solitar 
groups. 

Problem 1.7. What is complexity of WP in the groups B{u, v, m, n) ? 

1.5. Baumslag group G(i,2)- The group G(i_2) = (a, 6 | b^^a^^bab^^ab — a^^ is 
truly remarkable. Baumslag introduced this group in ^ and showed that all its 
finite quotients are cyclic. In particular, the group G(i^2) is not residually finite 
and, hence, is not linear. In [T3] Gersten showed that the Dehn function for G(i_2) 
is not elementary, since it has the lower bound tower2(log2(n)) and later Platonov 
in [25] proved that tower2{log2{n)) is exactly the Dehn function for G(i_2)- This 
shows that G(i^2) is not hyperbolic, or automatic, or asynchronously automatic. It 
was conjectured by Gersten that G(i_2) has the highest Dehn function among all 
one-relator groups. As we have mentioned above the time function for the Magnus 
break-down algorithm on G(i 2) is not elementary. Taking this into account it was 
believed until recently that WP in G(i 2) is among the hardest to solve among all 
one-relator groups. In this paper we show that the Word problem for G(i 2) can 
be solved in polynomial time. To this end we develop a new technique to compress 
general exponential polynomials in the base 2 by algebraic circuits (straight-line 
programs) of a very special type, termed power circuits [23j . We showed that one 
can do many standard algebraic manipulations (operations x + y,x — y,x-2y,x < y) 
over the values of exponential polynomials, whose standard binary length is not 
bounded by a fixed towers of exponents, in polynomial time if it is kept in the 
compressed form. This enables us to perform some variations of the standard 
algorithms in HNN extensions (or similar groups) keeping the actual rewriting in 
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the compressed form. The resulting algorithms are of polynomial time, even though 
the standard versions are non-elementary. 

1.6. Baumslag groups The approach outlined above is quite general and 
we believe it can be useful elsewhere. In particular, it works for groups of the type 
G(ra,n)^ whcrc TO dividcs n. Here the groups G(m „) are defined by the following 
presentations: 

Unfortunately, we do not have any compression techniques for the case when to does 
not divide n and n does not divide to. So the following problem seems currently as 
the main challenge regarding WP in one-relator groups. 

Problem 1.8. What is the time-complexity of the Word problem for G(2.3)? 

1.7. Generalized Baumslag groups. In 4J Baumslag, Miller and Troeger stud- 
ied another series of one-relator groups G(r, w) which are similar to the group G(i 2) ■ 
Namely, if r,w are two non-commuting words in F{X) then put 

G{r,w) = {X I r"^" = r^). 

The group G{r,w) is not residually finite (neither linear nor hyperbolic), it has 
precisely the same finite quotients as the group {X \ r). These groups surely 
among the ones with non-easy WP. 

Problem 1.9. Let r,w be two non-commuting elements in F{X). 

1) What is the Dchn function of G(r, w)? 

2) What is time complexity of WP in G{r,w)7 

Going a bit further one can consider WP in the following groups 

G{r,w,m,n) = (X \ (r™)'''" = r"). 

The paper is organized as follows. In section [2] we discuss algorithmic properties 
of elements of G(i.2) as an HNN extension of the Baumslag-Solitar group, set up the 
notation, and outline the difficulty of solvig the Word problem using the standard 
methods for HNN extensions. In Section |3] we define the main tool in our method, 
namely the power circuits, and present techniques for working with them. In Section 
U] we define a representation for words over some alphabet which we call a power 
sequence. In Section [5] wc present the algorithm for solving the Word problem in 
G(i 2) and prove that its time-complexity is OirJ). 

2. The group G(i.2) 

In this section we represent the group G(i 2) as an HNN extension of the Baumslag- 
Solitar group 2) and describe two rewriting systems TZ and TV to solve WP in 
G(i,2)- The system TZ represents the classical Magnus breakdown algorithm for 
G(i,2)- To study complexity of rewriting with TZ we construct an infinite sequence 
of words {wfe} such that 

• it takes at least tower^ik — 1) steps for TZ to rewrite Wk] 

• TZ rewrites Wk into a unique word of length tower2{k). 
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This shows, in particular, that the time funetion of the Magnus breakdown algo- 
rithm on 0(1^2) is not bounded by any finite tower of exponents. Our strategy to 
solve WP in G(i 2) can be roughly described as follows. We combine many ele- 
mentary steps in rewriting by TZ into a single giant step and make it an elementary 
rewrite of a new system TV . It is not hard to see that now it takes only polynomially 
many steps for TV to solve WP in G'(i,2)- In the rest of the paper we show that 
every elementary rewrite in TV (the giant step) can be done in polynomial time in 
the length of the input, thus proving that WP in G'(i,2) is decidable in polynomial 
time. 

2.1. HNN extensions. The purpose of this section is to introduce notation and 
the technique that we use throughout the paper. 

Let H he a, group with two isomorphic subgroups A and B, and : A B aw 
isomorphism. Then the group 

G = (H, t I a* = Lp{a) for each a A) = {H,t \ A* ^ B) 

is called the HNN extension of H relative to ip. We refer to [18] for general facts 
on HNN extensions. The letter t is called the stable letter. If H is generated by a 
set Y then Y U {t} generates G and any word w in the alphabet {Y U {i})^^ can 
be written in the syllable form: 

w{H,t) = /ioi^'/iit^^/i2 ■ . .t^"/i„ 

where = ±1 for each i = 1, . . . ,n and hi are words in the alphabet Y^'^ . The 
number n is called the syllable length oiw = w{H,t) and denoted by \w\t. A pinch 
in w is a subword of the type t~^ht with ft, G A or a subword tht^^ where h E B. 
A word w is reduced if it is freely reduced and contains no pinches. 

Theorem (Britton's lemma, 8J. Let G = {H,t \ A^ = B). If a word 

w{H,t) = hQe^hit^^h2 . ..t^^hn 

represents the trivial element of G then cither n — Q and w —h 1 or w{H,t) has a 
pinch. 

Corollary. Let G ^ {H,t \ A* B) . Assume that 

(Gl) The Word problem is solvable in H. 

(G2) The Membership problem is solvable for A and B in H. 

(G3) The isomorphisms ip and ip~^ are effectively computable. 

Then the Word problem in G is solvable. 

Proof. The decision algorithm that easily comes from the Britton's lemma can be 
described as rewriting with the following infinite rewriting system TZhnn'- 

{t~^ht ^ (fih) I h e F{Y) and h e A} (J 
(1) {tht-^ ^(t)-^{h)\heF{Y) andheBjU 

{ft -> e I ft e F{Y) and ft =h 1} 

where s is the empty word. 

□ 
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2.2. The group G(i 2) and Magnus breakdown. 

Proposition 2.1. Let G(i_2) = b~^a~^bab^^ab — a?) . Then the following 

holds: 

1) The group G(i 2) is a conjugacy pinched HNN-extension of the Baumslag- 
Solitar group -6(1,2) — {o-,'t\ t^^at = a^) with the stable letter b: 

Gii,2)^{Bii.2),b\b-'ab^t) 

2) An infinite rewriting system TZ: 

{t-^a''t ^ a^*^ I fc e Z} U {ta^'^t-i a'' \ k e 1} U 
(2) {b-^a''b ^ i'^ I fc e Z} U {bt>'b-^ ^ a*-- I fc G Z} U 

{aa^^ — !■ e, a^^a — > e, 56^^ — ^ e, 6^^6 — > e} 

is terminating and for any w — w{a, b), 

w —G I -i^ w £. 

In particular, TZ gives a decision algorithm for the Word problem in G(i,2), 

Proof. Notice that 

G(i 2) ^ {a,b\ b-^a-'^bab-^ab = a^) = {a, t, b \ f-'^at = b-'^ab = i) = 

= (S(i,2),6|6-ia6 = t) 

which proves 1). 

To prove 2) observe first that the groups 2) and G(i_2) are HNN extension, 
which satisfy the properties (Gl), (G2), and (G3) from Corollary 12.11 Hence WP 
in both groups can be solved by the corresponding rewriting systems of the type 
TZhnn from the proof of CoroUarv 12.11 Combining these rewriting systems into 
one we obtain the system TZ. It follows that for any w — w{a, b), 

w =G 1 <J=> w — >7j £. 

It remains to be seen that TZ is terminating. To see this associate with each word 
w — w(a, 5, t) a triple (a, /3,7) where a is a total number of b symbols in w, /3 is 
the total number of t symbols in w, and 7 = \w\. It is easy to see that any rewrite 
from TZ strictly decreases {a,/3,j) as an element of N'^ in the (left) lexicographical 
order. Now termination of TZ follows from the fact that 7^ is a well-ordering. □ 

Notice, that the system TZ is not confluent in general. 

Proposition 12 . II states that TZ solves the Word problem for G(i 2), but it does not 
give any estimate on the time-complexity of the rewriting procedure. To estimate 
the complexity of rewriting with TZ consider a sequence of words over the alphabet 
of G(i,2) defined as follows 

= a 

wi = {b^^waby^a{b^'^wob) 

w^+i = {b^^Wiby^a{b^''-w,b) 

Lemma 2.2. Let G — G(i_2) — {a,,b \ b^^a~^bab^^ab — a^'). Then (in the notation 
above) the following holds: 

1) for any i 
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2) ato™e''2(2) ig ffig only TZ-reduced form of Wi. 

3) it takes at least tower2{i — 1) elementary rewrites for TZ to rewrite wi into 

^tower2{i) 

Proof. By induction on k 

.-2 I ,2 I 2 I 

=Gt at =G a 

which proves 1). Now 2) and 3) are easy. □ 

Theorem 2.3. The the time function of the Magnus breakdown algorithm on G(^i^2) 
is not bounded by any finite tower of exponents. 

Proof. Since the presentation (a, b \ b~^a~^bab~^ab = a^) for G(i,2) has a unique 
stable letter b in its relator, the Magnus procedure represents G(i.2) as the HNN 
extension 

G(i,2) = (S(i,2),6| 6-ia6 = i). 

Similarly, the presentation {^a,t \ t~^at = a^) has a unique stable letter t in its 
relator, the Magnus procedure represents -B(i.2) as the HNN extension of Z = (a) 

5(1,2) = {{a),t I t-'at^a^). 

Now, to determine if a given word w — w{a, b) represents the identity of G(i 2) the 
Magnus process applies the Britton's lemma to the constructed HNN extensions. 
The rewriting system TZ describes precisely the applications of the Britton's lemma 
to the word w, when one first eliminates all the pinches related to b and then all 
the pinches related to t. Independently of how one realizes the rewriting in Magnus 
breakdown (rewriting with TZ) in a deterministic fashion the rewriting of the words 
Wi of ([3]) is essentially unique and takes at least tower 2{i — 1) elementary rewrites 
to finish. Notice that the length of the word Wi is less than 2'+^ — 1 and, as Lemma 
12.21 shows, reducing the word Wi produces the word of length tower2{i) . Hence the 
result. □ 

2.3. Large scale rewriting in G(i 2)- To make the rewriting by TZ efhcient one 
must be able to: 

• work with huge numbers that appear in powers during the computations; 

• perform rewrites at bulk, i.e., perform many similar rewrites at once. 

In Section [5] we will use the rewriting system 

TZ' ^ {b-^a"'b i", ^ a" I m e N} 

, , U{t''a"' a"2"'i-'' I m e N, to2-'= G Z} 

Uix'^x"' t'^+" I /c, m e Z, X e {a, 6, t}} 

instead of the system ^ . To perform such rewrites efhciently one must be able to 
perform the following arithmetic operations: 

(01) addition and subtraction; 

(02) multiplication and division by a power of 2. 

In the next section we introduce a representation of integer numbers over which 
the sequences of operations (01) and (02) can be performed efficiently. 
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3. Power circuits 

In this section we define a presentation of integers wliicli we refer to as power 
circuit presentation and sliow how one can perforin some arithmetic operations over 
power circuits. See [23 for more details on circuits. 

A power circuit is a quadruple (V, fi, M, v) satisfying the conditions below: 

• V = {VI(P), E{V)) a directed graph with no multiple edges and no directed 
cycles; 

• fi : E{'P) — > {1, —1} a function called the edge labelling function; 

• M C V{V) a set of vertices called the set of marked vertices; 

• and i^:Af— >{ — l,l}a function called the sign function. 

For an edge e = tii — > U2 in P denote its origin vi by a{e) and its terminus V2 by 
/3(e). For a vertex v in V define sets 

luy — {e eV \ (5{e) = v} and Out^ = {e G "P | a(e) — v}. 

A vertex w in P is called an source if In^ — 0. Inductively define a function 
£ : V{V) — s> M (£ stands for evaluation) as follows: for v G V{V) define 

, r if Out, = 0; 

^(") " \ 2^=.o„*„ Ke)S[m) otherwise 

We are interested in presentations of integer numbers only and hence we assume 
that £(v) G Z for each v € V. Since V contains no cycles the function £ is well- 
defined. Finally, assign a number A/" to a defined quadruple {V, /i, M, ly) as follows 

A/- = AA(7',M,Af,z.)- J2 ^M^W- 

If AA = MIV, pL, M, v) then we say that {V, /i, Af, u) is a power circuit presentation 
of the number A/" G M, or that A/" is represented by ("P, /i, A/, j^). Throughout the 
paper we denote the quadruple (V, ^, M, v) simply by V. 



a) 




Figure 1. Power circuits representing integers 1, —1, 16, 2 and 
35. Black vertices denote the marked vertices. An edge e is labeled 
with + if /i(e) = 1, a marked vertex v is labeled with + if vi^v) = 1. 



For a circuit V denote by |P| the number |V^('P)| + \E{V)\ called the size of the 
circuit and by N{P) the integer represented by V. 
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3.1. Zero vertices in power circuits. A vertex z in 'P is called zero if Outz = 0. 
It follows from the definition of the function £ that z is a zero vertex in V if and 
only if £{z) = 0. Clearly, each non-trivial circuit has at least zero vertex. If V has 
more than one zero vertex then its size can be reduced. 

Lemma 3.1 (j23j). Let zi and Z2 be distinct zero vertices of a circuit V and V' a 
circuit obtained from V by gluing zi and Z2 together. Then \V{V)\ = \V{V') \ + 1 
andM{V)=M{V'). 

3.2. Addition and subtraction. Let Vi and V2 be two circuits. To compute a 
circuit V+ such that M{V+) — J^{Vi) + M{P2) one can take a union of Vi and 
■P2 leaving the labeling functions the same. Clearly the obtained result satisfies 
the equality A/'('P+) = N'iVi) -\- M{'P2)- Similarly, to compute a circuit V- such 
that N{V-) ~ J^{Vi) — MiJ'2) one can take a union of Vi and V2 leaving the 
labeling functions on Vi the same and changing the labeling function on M{V2) to 
the opposite. Clearly the obtained result satisfies the required equality. See Figure 
[5] for an example of difference of two circuits. 




Figure 2. Taking difference of circuits d) and e) from Figure [TJ 

Proposition 3.2 ([23 ). Let Vi and 'P2 be power circuits and V+ = Vi + V2- 
Then N{V+) = M{Vi) + M{V2), l^(^+)l = \V{Vi)\ + \V{V2% and \E{V+)\ = 
\E{Vi)\ + \E{r2)\- Moreover, V+ and V- are computed in time 0{\Vi\ + \V2\)- 

3.3. Comparison (circuit reduction). In this section we shortly describe the 
procedure called the reduction of power circuits. For the precise definition of a 
reduced circuit see |23j . The main property of reduced circuits is that £{vi) — £(v2) 
if and only if vi = V2- 

Theorem 3.3 ([23)). There exists an algorithm which for every power circuit V 
constructs an equivalent reduced circuit V' such that 

\V{V')\ < \V{V)\ + 1 and \M{V')\ < \MiP)\, 

and orders vertices ofV' according to their £ values. Moreover, the time complexity 
of the procedure is 0{\V{V)\^). 

Proposition 3.4 ( 23j). Let be a reduced circuit. Then N{V) = if and only 
if V has no marked vertices. If V is not trivial and if v is the vertex with maximal 
£ value then M{V) > if and only if v{v) — 1. 
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Proposition 3.5 ([23J). There exists a deterministic algorithm which for every 
power circuit V computes 

Sign{V) = <! 0, ' 



1, 



iiN{V) < 0; 
if AA(7') =0; 
if AA(7') > 0. 



Moreover, the time complexity of that procedure is bounded above by 0(|y(7^)|'^). 
Proposition 3.6 (^23]). Let "P be a reduced power circuit, {vi, . . . , w„} a set of all 



its vertices ordered according to their 8 values, and i 
U{V) is divisible by 2*=^ if and only if £{vi) is. 



min{j I Vj e Af}. Then 



3.4. Multiplication and division by a power of two. Let Vi and V2 be power 
circuits. Assume that M{V2) > 0. In this section we outline a procedure for 
constructing circuits "P, and Vo satisfying 

2AA(P2) 



and 



RecaU that 7V(Pi) = Ei,ga/i v{v)£{v), where £{v) = 2^<=eO"*" Ke)£We)) ig ^ po^er 
of 2. Hence, to multiply M{Vi) by 2-^(^2) 

one can multiply the values of £{v) 
by 2^*^^^) for each v G M('Pi) which corresponds to increase of the value of the 
sum EeeOut„ (/3(e)) by AA(p2)- Thus, to muhiply N{Vi) by 2^^^=) one can 

perform the following steps: 

(1) make each marked vertex w in "Pi a source; 

(2) take a union of Vi and V2] 

(3) for each vi G Adi and V2 G M2 add an edge e = I'l — > and put /i(e) = 
J^(^^2); 

(4) unmark marked vertices of P2- 
See Figure [3] for an example. 

In this paper we work with integer numbers only. Hence the operation o is not 
always defined for all pairs Pi, P2 of circuits. To check if Vi o P2 is defined one 
can reduce the presentation of Pi and check the conditions of Proposition l3.6l To 
actually multiply Pi by 2^^^'^^'^ one needs to 1) reduce Pi, 2) invert the value of 
P2 and, 3) apply the algorithm outlined above to compute Vo- 



r 

•I 



•I* 



'1/ * 



.1* 



Figure 3. Multiphcation by a power of 2. 
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Proposition 3.7. Let Vi and V2 be power circuits and, V» and Vo are obtained 
by the outlined above procedures. Tlien 

1) Mir,) = Mir 1)2^^''-) and A/-(Po) = 

2) \vir,)\, \viro)\ < \vir,)\ + \vir2)\ + m\. 

3) The time required to construct "P, is bounded by Od'Pil + 1^21)- 

4) The time required to construct T'o is bounded by 0{\Viri)\^ + 1^21 + ■ 
IM2I). 

Proof. Straightforward to check. □ 

4. Power sequences 

Let X be a group alphabet, a;i,...,x„ £ X^'^ , 'Pi,...,7'„ power circuits. A 
sequence S = (xi, "Pi), . . . , T'n) is called a power sequence. We say that a 
power sequence S represents a word 

The following characteristics of power sequences are used in our analysis in Section 
m We denote by AI{S) the total number of marked vertices in its circuits, i.e., 

(x,V)<£S 

and Vis) the total number of vertices in its circuits, i.e., 

vis)^ J2 \v{r)\. 

(x,v)es 

If S represents a word w — Xi^ . . . x^ and g — x^' . . . is a subword of w denote 
by Sg the segment of S corresponding to g. 

A power sequence is reduced if it does not contain 

(Rl) a pair ix,r) where Nir) = 0, 
(R2) a subsequence {x,r), {x,r'). 

To reduce a power sequence S one can consequently replace non-reduced subse- 
quences (a;, r), ix, r') by the corresponding pairs [x, r + 7-"), and remove the pairs 
(Rl). The described process is called a reduction of a power sequence. 

Proposition 4.1. Let 5 be a power sequence and S' be obtained by reducing S. 
Then S and S' represents the same element of the corresponding free group F{A). 
Furthermore, M(5') < M(5) and ViS') < ViS). The time complexity of reduction 
is not greater than 0(|y(5)|^). 

Proof. Follows from Proposition 13. 21 

□ 

5. Algorithm for the Word problem in 6(1^2) 

In this section we describe an algorithm for the Word problem in G(i 2) and prove 
that it has polynomial time complexity. All words in this section are processed in 
power sequences. More details are given in Section [5Tl In Section [5^ we give the 
final complexity estimate. 
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Algorithm 1. Word problem for 0(1.2) 
Input: A word w = w(a, b, t). 

Output: Yes if w represents tlie identity in G(^i^2), No otherwise. 
1: Represent w as a product of powers 

(5) w{a, b, t) = go{a, t)¥^ gi{a, t)b''^g2{a, t) . . . 6^"g„(a, t) 
where 

(6) g^{a,t) = a"'^-°t^^-^a"'^-H^^-'a"'^-^ . . .t^^-'-a""^-"' 

and ei,Sij,mij G Z. 
2: Compute a power sequence S representing w. 
3: while S contains a subsequence Sg- satisfying the follwoing do 
4: if Ei < 0, Ei+i > 0, and Sg. =b aP for some p £Z then 

5: Replace a subsequence (6, Si)Sg- (b, Ei+i) in S with (6, £,+1), {t,p), {b, e^+i — 

1). 

6: end if 

7: if Ei > 0, Ei+i < 0, and Sg. —st^ for some p £Z then 

8: Replace a subsequence {b,ei)Sg{b,ei+i) inS with (6,ei+l), (a,p), {b,Si+i — 

l). 

9: end if 
10: end while 

11: if The obtained S involves letter b then 
12: return No. 
13: end if 

14: if If the obtained sequence represents the trivial element in i?(i,2) then 
15: return Yes. 
16: else 

17: return No. 
18: end if 



A single transformation on line [5] and line |8] decreases the total power of 6 in a 
power sequence S by 2. Hence Algorithm[l]performs at most \w\/2 transformations. 
Now, it remains to describe a procedure for checking if Sg- =g of or Sg^ —g for 
some p e Z for some p G Z. This is done in Section [5Tl 

5.1. Word processing in S(i.2)- Let 

(7) {a,V^,),{t,Vs,),{a,Vm,),{t,Vs,),{a,Vm,) . . .{t, Vs, ) , (a, V,n, ) 

be a power sequence representing a word 

g = a""'t^'a"'H^'a'"' ...t^-a"" 
over the alphabet of 2)- 

Proposition 5.1 (All non-positive powers). Consider a sequence Q. Assume that 
di + . . . + Si < for every i = 1, . . . ,k. Then g — a^H"^ in 0(1^2) where 

k k 

cr = ^ (5, and M = ^m,- 2^^^=^ . 

1=1 i=0 
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Furthermore, there exist power circuits Vm and Va such that M{Vm) = M and 
M{Va) = o and 

(1) \y{V.)\ = E -=1 and \V{Vm)\ < E-=o(l^(^™.)l + \M{nm,)\) + 

(2) \M{V,)\ = Y!;=^ \M{Vs,)\ and \M{Vm)\ = E'=o 

Proof. The equahty g = a^^t*^ in G(i^2) is obvious. A circuit Va can be constructed 
by laying down circuits for Vs^ , • • • , Vs^ ■ A circuit Vm is obtained by 

• laying down the circuits Vmo , Vs^, ■ ■ ■ ,7^5^; 

• adding edges between Vrm and Vs- as it is done in multiplication by a power 
of two (see Section 13. 4p ; 

• removing marked vertices from Vs- 's. 

See Figured! Clearly Vm and Va satisfy properties (1) and (2). □ 



P3i \P32 



Pk 



Figure 4. Scheme for a circuit representing J2i=o 'Prm ° (j2]=i ^^j) • 



We call the transformation of Proposition 15.11 a (Tl)-transformation. Applying 
(Tl)-transforniations to subsequences of ([7]) we either obtain a power sequence 

(8) {a,VMo),{t,'PaJAa,VMj,. ■ . Aa,VM„^,),{t,'P^JAa,'PMj, 
where ai > for every i = 1, . . . , n; or a power sequence 

(9) (a, Vmo), {t, Va,), (a, Vm,), • • • , (a, Vm,^^J, [t, V^J, (a, VmJ, {t, 7'.„+ J, 

where Ui > Q for every i = 1, . . . , n and ct„+i < 0. Furthermore, for ([S]) there exist 
a sequence \ < Iq < li < . . . < In ~ k such that 

(10) Va, = + . . . + and Pm, = ^ ^m. o T^^^ 

i=/i-i v=;. 

Similar formulas hold for The next lemma follows from the Britton's lemma. 

Lemma 5.2. // a sequence (0) or represents in S(i,2) element or t^ for 
some p G Z then for every i = 1, . . . ,n the condition 

2-"'(. . . + 2-^-'{Mn-2 + 2-""-^(M„_i + 2-'^"Af„))) G Z 

is satisfied. 
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Proposition 5.3 (All positive powers). Consider a sequence (|8]) or (l9|) representing 
an element g in i?(i.2)- If for every z = 1, . . . , n the condition 

2-""(. . . + 2-""-^(A/„_2 + 2-'^"-i(M„_i + 2-"- Ma))) G Z 

is satisfied then g — a^^t°', where a — X]r=o ^« ^'^'^ 

M = (Afo + . . . (M„_2 + (Af„_i + Af„ o a„) o (T„_i) . . . o ai). 

Furthermore, there exist power circuits Vm and T'o- such that M{Vm) = M and 
N{'Pa) ~ cr satisfying 

(1) \v{v.)\ = EU IvIT'm)! < e;U(I^(^m,)| + \m{Vm,)\) + 

(2) |Af(P.)| = E;=i and |Af(nf)| = E -=0 I^^(^m,)|. 

Proof. The equality g = a^^f follows from the Britton's lemma. To construct a 
circuit Va we lay down circuits for Va^ , . . . , Va^ . Clearly, satisfies the properties 
(1) and (2). 

We construct a circuit Vm by induction on n. If ?i = then Vm — 'Pug and we 
have nothing to do. The case when n = 1 provides us with the induction step. If 
n = 1 then we need to construct a circuit representing A/q + 2"'^^ Afi. By (fTO|) we 
have (7i = 5q + . . . + 5i and Mq = mQ + mi2~^^ + . . . + to/_is~'^i - ~*'-i. A structure 
of circuits V^i and Vm^ was described in Proposition 15. II To construct Vm we 

• put down Vmo^'Psi and a reduced Vmi', 

• make all vertices in Vsi unmarked; 

• {'Pmo contains subgraphs corresponding to Vsg, . . . ,Vs,^i) add edges from 
marked vertices in Vm^ to vertices in Vsg , . • . , Vsi that were marked as for 
operation o; 

• collapse zero-vertices in the obtained graph. 

It follows from the construction that N{Vm) — M and that properties (1) and (2) 
hold for the constructed Vm- The scheme for Vm is given in Figure [5] □ 




Figure 5. Scheme for a circuit representing A/q + 2 "^^Mi. 



Proposition 5.4 (Complexity of processing in B(i 2))- It takes 
/ / . 



O 



J=0 j = l 
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operations to determine if ([7]) is equivalent to a sequence (a, V) or a sequence [t, V). 
If ([7]) is equivalent to a sequence (a, P) then V satisfies: 

\v{v)\ < E(i^(^"b)i + +E and = E 

J=0 j=l j=0 

If ([7]) is equivalent to a sequence (t, T') then V satisfies: 

A; k 

\V{V)\ = E and 1M(P)| - E 

j=i j=i 

Proof. The bounds on |V^(P)| and \M{V)\ for both cases follow from Propositions 
15.11 and 15.31 Furthermore, at every step in the process all power circuits in the 
sequence ^ have the number of vertices bounded by X)j=o(l^(^™3)l + l-^(^'"j)l) + 
J2j=i l^CPSj)]- Hence, it takes up to 

k k 

o\ |E(i^(^™.)i + i^-^(^™.)i)+Ei^(^*. 

yj=0 3 = 1 




operations to check if conditions of Propositions 15.11 and 15.31 hold at every step. 
The algorithm performs 0{k) transformations and hence the claimed bound on 
complexity. □ 



5.2. Complexity estimate for Algorithm [TJ 

Theorem 5.5. Algorithm,[J\ solves the Word problem for G (^1,2} in time 0(|w|^). 

Proof. Let w — w{a,b^t) be a reduced word over the alphabet {a, 6, t}. First, Al- 
gorithm [T] constructs a power sequence S for w. As described in |23| it is straight- 
forward to construct circuits for numbers mi,ei,Si. Clearly, the total number of 
vertices for circuits mi,ei,6i is not greater than 2\w\. This can be done in Oduij) 
steps. 

In the loop [51-[TU1 Algorithm [T] determines what subsequences Sg- can be shortened 
into {a^V) or {t,V). By Proposition 15.41 this can be done in time 

0{\9^\■i\V{Vg,)\ + \MiVg,)\f) 

and the obtained circuit V satisfies 

\V{r)\ < V{Sg^) + MiSg^) and \M{r)\ < Af(5gJ. 

Hence, a single transformation on a step [5] or [8] 

• does not increase the total number of marked vertices in 5; 

• can increase the total number of vertices by the number of marked vertices. 

Therefore, in the worst case steps [S] and |S] are performed on a sequence Sg^ of size 
^('S'si)l — Algorithm [1] performs up to \w\ steps [5] and 13 Hence the 

result. □ 
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